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(57) ABSTRACT 

A server system on a network such as the World Wide Web 
aggregates and stores information about the content of one 
or more Web pages served by a server on a network. A server 
process receives one or more requests for one or more Web 
pages from one or more requesters connected to the network. 
Upon receiving the request, the server produces a requested 
Web page and serves the Web page to the requester. The Web 
pages have with one or more content elements in addition to 
one or more metadata entries. One or more of the metadata 
entries are associated with the content elements of the 
respective Web page produced. The system also creates and 
maintains a log having a plurality of records. Each record 
has one or more requester fields and one or more metadata 
fields. A logger process stores the metadata entries contained 
in each of the Web pages in one or more of the metadata 
fields, and stores a requester identification, associated with 
the requester, in the requester field of the record associated 
with the respective Web page. It comprises two parts: the 
Web page description field and the hyperlink field list. 

18 Claims, 12 Drawing Sheets 
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SYSTEM AND METHOD FOR COLLECTING 
AND ANALYZING INFORMATION ABOUT 
CONTENT REQUESTED IN A NETWORK 
(WORLD WIDE WEB) ENVIRONMENT 

FIELD OF THE INVENTION 

This invention relates to collecting and analyzing infor- 
mation about the content requested and provided in a 
networking environment. More specifically, the invention 
relates to collecting and analyzing information about the 
content requested from users and provided by a server in an 
e-commerce application on a network such as the Internet. 
In particular, the invention relates to collecting such data 
over a period of time in a log record which can be subse- 
quently used in aggregation and analysis. 

BACKGROUND OF THE INVENTION 

The Internet, fueled by the phenomenal popularity of the 
World Wide Web, has exhibited exponential growth over the 
past few years. It has gone from being a communication 
route primarily for scientists, researchers, and engineers to 
an essential information exchange vehicle for broad seg- 
ments of the populace, including consumers, marketers, 
educators, children, and entertainers. Over one billion Web 
pages currently exist on the Internet, and over 40 million 
users read and interact with them. As the Internet's com- 
mercial value is recognized, numerous companies and orga- 
nizations are experimenting with electronic commerce (also 
referred to as e-commerce), the buying and selling of goods, 
information, and services over the Internet (see, for example, 
IBM's Web site for Macy's at http://www.macys.com). And 
as more and more of these companies demonstrate the 
financial viability of electronic commerce, there has been 
increasing momentum to develop sites that transact business 
over the Web. 

Any Web site owner needs to know whether the Web site 
effectively serves its intended purpose; that is, how many 
people visit the Web site, who these people are, what they 
want, and what they do at the site while they are there. This 
is particularly true of the domain of electronic commerce. 
The ability to analyze and understand traffic flow, the way 
customers navigate from page to page in a site, is critical for 
successful product marketing and sales. 

The major source of user activity data available today is 
the Web server log. A Web server is the computer that sends 
World Wide Web documents to browsers upon request. The 
Web server log is a low-level, technical account of Web 
server activities and is generated by all commonly used Web 
servers. The Web server log consists of a file containing an 
entry for each Web page served, showing the IP (Internet 
Protocol) address of the client (the machine of the user who 
is visiting the Web site using an application); a timestamp, 
indicating the exact date and time on which the visit 
occured; the URL (Universal Resource Locator) of the 
requested page, the referrer URL (the URL of the page that 
the user clicked on to get to the current page), the browser 
type, and the number of bytes transferred. 

Various commercial products and freeware packages (for 
example, Accrue 's Insight, Andromedia's ARIA. e.g. Soft- 
ware's WebTrends, and Aquas's Bazaar Analyzer) use Web 
server logs to analyze Web server and user activities and 
generate reports. Examples of the kind of information that is 
typically reported are the number of visitors at a Web site 
during a given time period, the most and least frequently 
visited pages, the most frequent entry pages (the first page 
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a user visits during a session at a Web site), the most frequent 
exit pages (the last page a user visits during a session at a 
Web site), and the visitor demographic breakdown based on 
IP address and browser type. The URLs in the Web server 

5 log often contain special user identifiers obtained by using 
"cookies". A cookie is a piece of information shared between 
a user's Web browser and a Web server, originating as a 
message sent by a Web server to the Web browser visiting 
the server's site, subsequently stored in a text file on the 

30 user's hard drive, and sent back to the server each time the 
browser requests a page from the server. From the sequence 
of URLs in the Web server log and the associated cookies, 
it is possible to reconstruct the URL paths that individual 
users traverse, and from this obtain the most frequently 

15 traversed paths through the Web site. 

Some Web sites in the past have used clever methods for 
collecting more data about user behavior by using a form of 
URL rewriting. They tag extra data about the user/requester 
to URLs of the served Web pages, so that the extra data of 

20 their analysis needs will be found in the server log. This 
method is usually used for adding user-related data (e.g., 
user-id and session-id). 

Advertising banner services have developed an interest- 
ing way to measure not only who clicked on their banners, 

25 but who saw the banners. These figures not only are used to 
calculate the rate charged for the banners, but also the 
effect iveness of the banner, known as the conversion rate. 
The conversion rate is found by dividing click-thoughs by 
impressions (the number of times that the banner was served 

30 and hopefully seen). Currently the prior art is able to 
determine conversion rates only for specific types of advert- 
ing banners. Some Web advertising services (e.g., Real 
Media's Open AdStream) record impressions and click- 
throughs by using script programs, programs consisting 

35 mainly of strung together commands, such as those you 
might issue at a command line. These services add a script 
program to the HTML image source tag, which points to the 
image displayed as the advertisement. (HTML, HyperText 
Markup Language, is the authoring tool used to create 

40 documents on the World Wide Web, Tags are commands, 
generally specifying how a portion of a document should be 
formatted; tags can also refer to the links which allow users 
to move from one Web page to another.) In addition, these 
services add a script program to the anchor tag, the HTML 

45 tag which acts as a link to the advertised site. The first script 
gets invoked when the advertised image is displayed and 
records its view; the second script gets invoked when a 
visitor clicks on the image (to visit the advertised site) and 
records the click. 

50 

PROBLEMS WITH THE PRIOR ART 

One fundamental limitation of existing Web site analysis 
tools is that they rely solely on information in the server Web 

55 log, which is URL-based. Why isn't this enough? A URL 
indicates only the location of a served Web page and often 
very little about its content, particularly if the page in 
question is dynamic (generated from a database, a person- 
alization profile, or search query) or simply no longer exists. 

60 Business people, on the other hand, are interested in the 
content viewed by their audience, not the addresses of that 
content What products are customers looking at? What 
products are they being shown? Do pages contain the 
products in which customers are interested? Is the style of 

65 presentation working? Is there easy access to the informa- 
tion the visitor is looking for? What links on each page did 
the visitor not click on? In an electronic commerce Web site, 
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answers to these kinds of questions can feed back into the of the Web pages in one or more of the metadata fields, and 

architecture and design of the Web site, increase its stores a requester identification, associated with the 

effectiveness, and thereby maximize the return on invest- requester, in the requester field of the record associated with 

ments Unfortunately, it is not straightforward to answer the respective Web page. 

these questions for today's Web sites with existing log 5 i n a preferred embodiment, an aggregation process 

analysis tools. traverses the log to extract one or more of the metadata 

A second limitation of conventional log analysis software fields. A counter set, generated by the aggregation process, 

products is that while they provide the click-throughs of has a plurality of counters. Each counter has a counter object 

hyperlinks, none of them can provide the impressions of (or counter type), a counter event, and a counter value. For 

hyperlinks and conversion rates as do Web advertising io instance, the counter keeps track of how many times a 

banner services. Unfortunately, even the method used by the particular hyperlink, Web page, product, and/or product 

Web advertising services restricts them to collecting impres- category was seen or selected (event) by requesters. A 

sion data only for specific types of hyperlinks such as conversion rate set. also generated by the aggregation 

image -based ad banners, not for text- or form-based hyper- process, has a plurality of rates, where each rate has a rate 

links. Also, this method is costly, because the script pro- 35 object, a rate event and a rate value. For instance, the 

grams need to be invoked on a hyperlink basis (one invo- conversion rate set might track how many times a product/ 

cation for every link), as opposed to on a page basis (one product category was selected with respect to the number of 

invocation for every page). times a particular Web page or hyperlink related to the 

Without the ability to collect, aggregate, and/or analyze selected product/product category was seen, 

detailed information about the interaction of visitors with 20 
Web content, Web designers and marketers currently rely on 

ad hoc knowledge of a few experts in the area (e.g. creative FIG. 1 is a block diagram of one preferred embodiment of 

designers). The current dependence on a few human experts the present system. 

for Web site design and management is evidence that it is piG. 2 is a block diagram of a log record template which 

more of an art than a science, and that there are not sufficient 25 contams re q Ues ter identification information and metadata 

systems or tools for it. This method is expensive, inefficient, about a requested Web page. 

faulty, and subjective. It is often seen that experts express FIG. 3 is a block diagram of a hyperlink field template 

contradictory opinions about the same Web site design. which metadata about a hv p er link on a requested 

OBJECTS OF THE INVENTION 3Q Web page. 

An object of this invention is an improved system and FIG - 4 is an example block diagram of a log record, 

method for logging information about Web requesters and FIG. 5 is an example block diagram depicting source and 

content of Web pages served by a server on a network, metadata specification of a Web page where the source 

particularly a server on the World Wide Web. determines the presentation of the page and the metadata 

SUMMARY OF THE INVENTION 35 8 iv ^^ mat ^ ^ut the page contents. 

FIG. 6 is a now chart or a preferred logger process. 
The present invention is a computer system and method _ . , , , r t t c 

c ,i A . , . *. j . ' c MG. 7 is a block diagram ot a counter set of one or more 

for collecting, analyzing, aggregating, and storing informa- 4 A ° 

counter records 

tion about the content of one or more Web pages served by, rt . , i ' 

a server on a network. In a preferred embodiment, the server 40 8 15 a bloc * dia S ram of a <*> ave ™™ rate set of one 


BRIEF DESCRIPTION OF THE FIGURES 


is on the World Wide Web and, is performing an e-commerce 


or more rate records. 


DETAILED DESCRIPTION OF THE 
INVENTION 


function such as hosting a store that sells products or FIG. 9 is a flow chart of a preferred aggregation process, 

services. FIG. 10 is a flow chart of a preferred counter update 

The server has one or more central processing units, one process, 
or more memories, ad one or more network interfaces 45 FIG. 11 is a flow chart of a preferred conversion rate 
connected to one or more networks. A server process is computation process, 
executed by one or more of the central processing units and 
receives one or more requests for one or more Web pages 
from one or more requesters connected to the network. The 
requests enter the server through one or more of the network 50 FIG. 1 is a block diagram of one preferred embodiment of 
interfaces. Upon receiving the request, the server produces the present system (100) showing one or more Web request- 
each requested Web page from one or more memories, ers (101), a Web server system (150), a network (125) 
serves Web pages to the requester, and continues until all connection (105) between the Web requesters (101) and the 
requested Web pages have been served. The Web pages have Web server system (150), and a data aggregator system 
one or more content elements, blocks of text, images, and/or 55 (160). 

hyperlinks which provide specific information about pre- A Web requester (101) uses a Web browser software, e.g., 

defined areas of interest, in addition to one or more metadata Netscape Navigator or Microsoft Internet Explorer, running 

entries, tags in a meta language which categorize the content on a computer, e.g., an IBM desktop computer, to commu- 

elementsofa Web page. One or more of the metadata entries nicate with one or more of the Web server systems (150) 

are associated with the content elements of the respective 60 over a computer network or a collection of such networks 

Web page produced. In a preferred embodiment, each meta- (125), e.g., the Internet. The Web requester sends one or 

data entry has an entry type and an entry value. m0 re requests for Web pages (102) to the Web server system 

The system creates and maintains a log having a plurality (150) and receives from the Web server system (150) the 

of records. Each record has one or more requester fields and Web pages (103) which are rendered on the requester's 

one or more metadata fields. 55 browser software. 

A logger process is executed by the server process. The In an electronic commerce environment, Web server 

logger process stores the metadata entries contained in each systems (150) provide facilities for commerce activities of 
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Web requesters (101) in addition to the basic Web page metadata (503) attached to the Web page (103) and stores it 

service. In such an environment, Web requesters (101) may (503) in a record (400) in a log (200) along with some 

be able to not only browse information about merchandise information about the Web requester (101) who made the 

and services for sale on the Web server system, but also respective Web page request (102). The server process (110) 

place one or more orders for products or services of their 5 is able to collect the information on the Web requester (101) 

need, pay for the purchased merchandise or services, and from the information embedded in the request (102), and 

request other customer services regarding the purchasing passes requester information to the logger process (600). 

activities. (See the description of FIG. 6, below.) After recording the 

The Web server system (150) is typically a computer metadata (503) in the log (200), the log process (600) gives 

workstation, e.g., an IBM RS/6000 computer, on which two 10 back tne Weo P a g e (103) l6 the server process (110). At this 

or more software processes including a server process (110) point, the logger process (600) may remove the metadata 

and, typically, a logger process (600) are executed. The ( 503 ) from lne w eb page (103), so that the server process 

server process (110) provides a core function of the Web to the requester (101) only the Web page without 

server system (150). It receives one or more requests for * ts metadata (500A, 103). 

Web pages (102) from one or more Web requesters (101), 15 The data aggregator system (160) is a computer 

produces the requested Web pages (103) from the Web page workstation, e.g., an IBM RS/6000 computer, on which one 

repository (140), and sends back each of the Web pages or more software processes including the aggregation pro- 

(103) to the Web requesters (101) over the computer net- cess (900) are executed. Note that the data aggregator 

work or a collection of such networks (125). system (160) may be separate or combined with the Web 

The Web page repository (140) is a computer storage 20 server system (150). The aggregation process (900) 

system, e.g., a file system of an IBM AIX operating system traverses the records (400) in the log (200) one by one, and 

or an IBM DB2 database system, that stores Web pages counts the numbers of distinct occurrences of one or more 

(103) served by the Web server system (150). There are metadata (503) types for a fixed period of time. The process 

sections of the repository (140), static (130) and dynamic records the numbers in one or more counter records (750). 

(115 and 120) that respectively handle two types of Web 25 Th e collection of the counter records (750) builds up a 

pages (103), i.e., static pages (130A) and dynamic pages counter set (700). For example, a counter record may store 

(115 A). Static Web pages (130A) are stored in the repository the number of impressions of an image link of an IBM G42 

in the same form as they are served to the Web requesters monitor product on the Web pages (103) served by the Web 

(101). They are stored in the static Web page database (130). server system (150) for a certain time interval. Also a 

Dynamic Web pages (115A) typically are dynamically gen- 30 counter record (750) may record the number of click- 

erated by the server process (110) only when there is a throughs on an image link of an IBM G42 monitor product 

request for one. The server process (110) extracts one or among the Web pages served, by the Web server system 

more values of one or more parameters embedded in a (150) for a fixed period of time. 

request (102) from a Web requester (101), and uses the The aggregation process (900) may also determine one or 

parameter values to obtain data relevant to the requested 35 more rates by using the numbers recorded in the counter 

Web page from the content database (120). Then the server records (750) in the counter set (700). (See description of 

process (110) selects a relevant Web page template (116) FIGS. 7 and 8, below, for more detail.) Each rate is recorded 

from the Web page template database (115), and constructs in a rate record (850), and the conversion rate set (800) 

the Web page (115 A) by filling the extracted data into the comprises the collection of the rate records (850). For 

Web page template (116). 40 example, a rate record (850) may store the rate of the number 

For example, if the server process (110) receives a request of click-throughs to the number of impressions of an image 

whose URL is www.xxx.com/cgi-bin/e-store/ link of an IBM G42 monitor product among the Web pages 

category?category number=3318, it first extracts the value served by the Web server system (150) for a fixed period of 

3318 for the parameter, category number. Next, it obtains 45 tmie - 

data about the category from the content database (120) by FIG. 2 is a block diagram of a log record (400, 250) 

using 3318 as the data retrieval key. Then, the server process (including a template in a preferred embodiment) which 

(110) selects a category display template (116) from the Web contains requester identification information (225) and 

page template database (115), and fills the retrieved data metadata (220) about a requested Web page (103). The log 

about category 3318 into the template (116) to produce the 5Q (200) comprises a set of these log records (250). Each log 

dynamic Web page (115A) which then can be sent to the record (250) contains information describing the user (225) 

Web requester (101). who requested the Web page (103) as well as a description 

The Web page repository (140) also stores metadata (503) ( 22 °) of the requested Web page. Instantiations of this log 

about Web pages, static (130A) or dynamic (115A). The (200) will be used in the analysis of Web server (150) and/or 

metadata about a Web page is a description of the Web page 55 requester behavior. The structure of this template is thus 

itself and/or its content such as hyperlinks embedded in the carefully designed to capture those pieces of information 

Web page (103). The metadata of a Web page is created, whicn wil1 be most usefu l in the analysis process, 

updated, stored, and retrieved with the corresponding Web In one preferred embodiment, the requester field (210) 

page (115A, 130A). Thus when the server process (110) contains information to identify and classify the user making 

produces a Web page to serve, the server process (110) gets 60 the request. Here there are typically three subfields of 

a Web page (103) with its metadata (500, 503). interest: 

In a preferred embodiment, the logger process (600) is a a unique identifier (212) for a computer or other device on 

software process running on the Web server system (150) in a TCP/IP network, used for routing purposes in accor- 

conj unction with the server process (110). The logger pro- dance with the TCP/IP protocol, i.e., the IP address of 

cess (600) receives every Web page with metadata (500) 65 tne requester (212). 

from the server process (110) before the page (103) is sent a piece of information that uniquely identifies the par- 

to the requester (101). The logger process (600) extracts the ticular session between the client/requester (101) and 


03/23/2003, EAST version: 1.03.0007 


US 6,466,970 Bl 

7 8 

the server (150), e.g., information shared between a information is envisioned. In the electronic commerce 

requester's Web browser and a Web server (150). In a domain, additional information might include product iden- 

preferred embodiment this could be a requester cookie tifier in the case of the product page, or department name in 

(214), originating as a message sent by a Web server to the case of the department page. In a preferred embodiment, 

a Web browser visiting the server's site, subsequently 5 this field has the representative format: 

stored in a local file, and sent back to the server upon referenced Web page description: <page category> 

request. The requester cookie could provide the session <category-specific, metadata> 

identifier (216), the alphanumeric string which The application-specific subfield (340) is used to record 

uniquely identifies the particular session(s) between a information about the hyperlink that is specific to the 

client (101) and a server (150). 10 application. This subfield (340) specifies how this technol- 

an optional visitor classification (218), e.g. a method of °gy is used (applied) in real-world life. The application in 

categorizing visitors according to some predetermined this ca s e is the use (application) of Web technology in the 

classification. e-commerce area. In addition, the subfield (340) may 

The metadata field (220) describes the Web page that is include, but is not limited to, usage and purpose designations 

being requested. It comprises two parts: the Web page 15 f° r tne hyperlink. In the electronic commerce domain, an 

description field (230) and the hyperlink field list (240). The optional merchandising classification field (342) can be used 

Web page description field (230) gives a general description to identify the primary merchandising cue for which the 

of the Web page (103). It (230) typically comprises the Web hyperlink is being used. Examples of merchandising cues 

page URL (232), the referral URL (234), i.e., the URL of the include, but are not limited to, direct, up-sell, cross-sell, and 

page the requester was on when the person requested this 20 promotion. Note that the same hyperlink may be used in 

Web page, and the Web page description (236). In a pre- multiple merchandising scenarios. For example, the same 

ferred embodiment, the Web page description (236) typi- product link could be used for both a direct product link that 

cally comprises a text string that gives a thumbnail charac- might be used for standard catalog browsing, and a cross-sell 

terization of the requested Web page! An example (236A) of in which one product is presented along side another. In a 

the Web page description field is given in FIG. 4 where the 25 preferred embodiment, the representative format for this 

Web page annotated in the log record describes a particular subfield is as follows: 

product category which is laptop computer. merchandising classification: <merchandising cue name> 

The hyperlink field list (240) comprises a set of hyperlink The optional appearance subfield (350) is used to capture 

fields (242) from the requested Web page (103) to some specific information about how the hyperlink is visually 

other Web page. Thus, this list characterizes the types of 30 presented to the user. The media type subfield (352) iden- 

pages which can be accessed from the requested Web page tifies the type of media that was used to represent the 

(103). Because the function of a Web page (103) often lies hyperlink. The media type values are text, image, script and 

mostly in the set of Web pages which can be accessed, this form, but may include others. The font type (354) identifies 

information is included as part of the metadata field (220) in the kind and size of font that was used to represented the 

a preferred embodiment. The hyperlink field description is 35 hyperlink in the case that the media type value was text. The 

defined in detail in the description of FIG. 3 below. A color of text or image links is specified in the color subfield 

particular instantiations of this record template is given in (356). 

the description of FIG. 4 below. This location subfield (358) provides information about 

FIG. 3 is a block diagram of a hyperlink field template where on the page the hyperlink was found. Various methods 

which contains novel metadata about a hyperlink on a 40 of location can be specified and might include, position in a 

requested Web page. The hyperlink field (242) depicted in list, quadrant of a page, or element of a page (i.e. informa- 

FIG. 3 is an entry of the hyperlink field list (240) depicted tion bar, site navigator bar, etc). This subfield will be 

in FIG. 2. A hyperlink is a piece of text, an image, a video represented in the following format: 

clip or any other content clement, that when activated takes location: <locator> <locator data> 

the requester to another content presentation. The hyperlink 45 The <locator> field is used to identify the location method 

field list is used to capture and record information about each and the <locator data> field is used to hold locator-specific 

hyperlink contained within a content presentation or a Web data, for example, list position (n of m), quadrant number (1, 

page. This information enables the invention to assess the 2, 3, etc), or element name (info, nav, etc). • 

efficacy of particular hyperlink features. FIG. 4 depicts an example of one non-limiting specific 

The hyperlink field (242) contains application specific 50 instantiation of the log record template (116) used to create 

information about a hyperlink accessible from the requested the log record (400, 250) shown in FIG. 2. This particular 

Web page (103). The hyperlink field (242) contains an log record (250A) has a specific timestamp (405) stating that 

identifier of the accessible Web pages, e.g. the hyperlink the record was written on Jul. 22, 1998 at 11:33, when the 

URL subfield (332). Typically included with the URL sub- Web page (103) was requested. 

field (332), is the information describing the hyperlink, 55 The requester field of this particular record (210A) has 

which can include information about the Web page being specific values provided. For example, the IP address (212) 

referenced (336), the appearance or the way the hyperlink has the value "9.2.250.16" (212A), the usual sequence of 4 

was presented (350), and some application-specific infor- numbers between 1 and 255, separated by dots; the visitor 

mation (340). classification (218) is that of "frequent visitor" (218A). 

The referenced Web page description subfield (336) is 60 Values are given for the cookie (214A) and the session 

used to record summary information about the Web page identifier (216A) as well. 

being referenced. Information is recorded at a summary The instantiated metadata field (220A) contains an instan- 
level and at a minimum includes the type of the referenced tiated Web page description field (230A) and an instantiated 
page, but may include additional information. A classifica- hyperlink field (240A). The instantiated Web page descrip- 
tion of electronic catalog pages, such as search page, depart- 65 tion field (230 A) gives the URL of the requested Web page 
ment page, product page, home page, information page, etc., (in this case www.xxx.com/cgi-bin/e-slore/ 
could be specified in this subfield. Other classification category?cgrfnbr«3318), the URL from which the request 
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was made (in this case (234A), the user made the request 
while on a search page by typing in the keyword "laptops"). 
The Web page description (236A) tells us that the requested 
Web page is a "product category" page, specifically the 
category Web page describing "laptop" computers. 5 

The instantiated hyperlink fields section (240A) has two 
hyperlink fields, describing Web pages which can be 
accessed from the requested Web page (103). In the first case 
(242A), the Web page is a "product category" page, describ- 
ing "E Series" computers; the merchandising classification 10 
is "promotion", which means that the product is showcased 
as part of a special offer, the location is "6/7", meaning that 
it is sixth in a list of 7 hyperlinks, and the media type is 
"image", meaning that one click on a picture is required in 
order to go to the referenced Web page. The second case 15 
(242 B) provides the same types of information, i.e., the 
URL, referenced Web page description, merchandising 
classification, location and media type as the first case. In 
this case, the merchandising classification is "up-sell", 
which means that the product is a similar but more upscale 20 
item than the items on the current Web page. 

FIG. 5 is an example block diagram depicting source and 
metadata specification of a Web page (500). The diagram 
comprises the following four parts: the declaration of the 
particular markup language used (501), possibly including a 25 
pointer to the file containing the specification of the par- 
ticular language fragment of interest, the metadata descrip- 
tion of the Web page (503), the formatting language source 
code for the Web page (505), and a set of links connecting 
chunks of the metadata description to chunks of the source 30 
(507). 

In the example, the declaration section (501) comprises a 
statement specifying that the markup language used is XML 
(extensible Markup Language), and points to a file contain- 
ing the XML specification, i.e., "page.dtd". Note that, for the 35 
metadata description (503), other types of well known 
markup language can be used, e.g. HTML (HyperText 
Markup Language) and SGML (Standard Generalized 
Markup Language). Typically, markup languages are used to 
insert the sequence of characters or other symbols, often 40 
called "tags", at certain places in a document to indicate how 
the document should look when it is printed or displayed, to 
describe the logical structure of the document, or to provide 
information about the document as shown in the metadata 
description section (503). 45 

The metadata section (503) contains several metadata 
elements which give meta-level descriptions of the various 
content elements of the Web page. For example, the top 
metadata element depicted (510) gives the meta-level 
description of the Web page which gives information on the 50 
type (511) and topic (513) of the Web page; the second 
metadata element (520) gives a meta-level description of a 
hyperlink. Each element contains several entities which give 
the actual meta-level information. For example, the meta- 
data element of the hyperlink (520) contains entries describ- 55 
ing the type (521) and topic (523) of the Web page refer- 
enced by the hyperlink, the merchandising classification of 
the hyperlink (525), the position of the hyperlink (527), and 
the media type of the hyperlink (529). Each entry consists of 
a (type, value) pair; for example, in entry (527), the type is 60 
position and the value is "6/7". 

The source section (505) gives the actual source language 
for the formatting of the Web page. In this example, the 
familiar HTML is used; however, other languages are pos- 
sible. The section comprises content elements (542) which 65 
correspond to elements of the Web page. Examples of 
content elements (542) include, but are not limited to, the 
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header part of the HTML source (540) and the HTML tag of 
a hyperlink (544 and 546). 

A set of links (507) is shown, relating metadata elements 
and the content elements they represent. For example, a link 
(592) connects a metadata element (520) (the description of 
a hyperlink) with a content element (544) (the HTML source 
of the hyperlink). 

The "described by" connections (550) indicate that meta- 
data entries (512) in the metadata section (503) have infor- 
mation about content elements (542) in the source section 
(505). In the example in FIG. 5, each content element (542) 
has a corresponding metadata entry (512). Further some of 
the content elements are hyperlinks and there is novel 
metadata containing information about those hyperlinks 
(503). 

FIG. 6 is a flow chart of a preferred logger process (600) 
which is used to extract and log the metadata (503) associ- 
ated with each Web page (103) as it is served to the requester 

(101) . When the server process (110) receives a request from 
a user (605), the logger process (600) accesses to the server 
process (110) and collects requester data (210A) that will be 
used to populate the requester field (210) of the log record 
(250) for this request (610). In the meantime, the server 
process (110) processes the request (102), and retrieves the 
requested Web page (103) from the Web page repository 
(615, 140). Then the server process forwards the requested 
Web page to the logger process (620). The Web page (500) 
is one as depicted in FIG. 5 and as such includes both HTML 
source section (505) and the metadata section (503) that 
describes the HTML source. 

The logger process (600) extracts the metadata section 
(503) from the Web page, and forwards the remaining 
HTML source section of the page (505) back to the server 
process (625). Then the server process (110) sends the Web 
page (103) back to the requester (101) through the network 
connection (630). 

The logger process (600) parses the metadata section 
(503) and collects metadata entries (512) about the page that 
will be used to populate the metadata field (220) of the log 
record for this request (635). All or part of the metadata 
section (503) may be selected to populate the metadata field 
(220) in the log record (250) of this request. The logger 
process (600) extracts metadata value (503) in a Web page 
(500) and fills in the subfields of the metadata field (220) of 
the log record template (250) with the extracted metadata. 
Note the difference between the metadata value (503) in a 
Web page and the metadata field (220) in a log record 
template. An example of the result of this process is shown 
in a log record (250A) in FIG. 4 whose metadata field 
(220A) is filled with metadata values extracted from the 
metadata section (503) of a Web page (500). The logger 
process creates a new log record (e.g. 250A) for this request 

(102) and populates the requester (e.g. 2 10 A) and metadata 
field (220) of the record (250) with the data it collected 
above (640). 

Given the detailed description of the system and method 
for collecting and analyzing information above, a more 
detailed description of a preferred embodiment for aggre- 
gating and further analyzing this information is now given as 
a preferred embodiment of the invention in the description 
of FIGS. 7-11 below. Note that this aggregation system and 
method has general applications in the analysis of Web site 
performance and is fiirther described and claimed in U.S. 
patent application Ser. No. 09/238,238, entitled AGGRE- 
GATING AND ANALYZING INFORMATION ABOUT 
CONTENT REQUESTED IN A NETWORK (WORLD 
WIDE WEB) ENVIRONMENT TO DETERMINE CON- 
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VERSION RATES, filed on the same day as this invention 
with the same inventorship, which is herein incorporated by 
reference in its entirety. 

FIG. 7 shows a counter set (700) of records generated by 
the aggregation process (900). The purpose of these counter 5 
records (750) is to count the number of occurrences of 
certain events as they relate to certain objects (752) in other 
counter records (750). 

Generally, an object is the entity, or type of information, 
used for computing conversion rates. An object can be either 
a Web page or a hyperlink in a Web site. By using the 
metadata of the Web page and/or hyperlink (particularly, the 
Web page description field (236) for the Web page and 
referenced Web page description field (336) for the 
hyperlink), the Web page and the hyperlink can be given a 
name meaningful to Web site owners, merchandisers and 15 
designers. These names can include: a product name, a 
product category name, a shopping-cart page, a product- 
purchase page, a home page, a search request page, a search 
result page, an index page, etc. In an e-commerce Web site, 
these information types are of particular interest when 20 
computing the conversion rates for products and/or product 
categories. To continue the explanation by example, an 
object can be page-related and/or hyperlink-related. Some 
examples of objects are the information in FIG. 5, e.g. items 
511 and 521. For instance, object values can be the values 25 
(e.g., laptops, E series) of a Web page description and a 
referenced Web page description of metadata entries (513, . 
523) in FIG. 5. 

Later, conversion rates for one or more of these objects 
are determined. Typically, objects are selected because they 30 
are related to revenue of the Web site. They are defined at a 
page-level and/or a hyperlink- level, as well as at the level of 
the entire Web site as a whole. Also objects can be sub- 
classified, e.g., in a product taxonomy like vendor, price, 
color, product type. Typically, a taxonomy is a hierarchical 35 
structure of products by criteria such as department, 
category, vendor, and color. Thus by using the invention, 
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process. Steps in the shopping process include: being intro- 
duced to the product, browsing the information of the 
product, inserting the product into the shopping cart, and/or 
placing an order for the product, as they click on hyperlinks 
of the product associated with each step. 

A counter record (750) in the figure, for example, stores 
the number of page impression events (754) for the home 
page (752) and has 39,341 as its value (756). With this 
invention, it is possible to count the number of occurrences 
of certain events for any object, i.e., Web page, product, 
category, or hyperlink shown on the page, when the Web 
pages are served. In contrast to the prior art, there is no need 
to attach a small counting program to an object of interest 
and invoke the program each time the object is seen or 
clicked. Therefore, the cost for event counting process is 
kept to a minimum. This fact can be used on a large scale to 
count multiple events for multiple objects at the Web page/ 
Web site. Using this invention, a huge variety of events 
(754) can be counted/tracked and eventually used and evalu- 
ated in the aggregation process (900). This invention permits 
the monitoring and tracking of these events, especially the 
events in shopping flow in an e-commerce Web site, and 
helps determine the effectiveness of the Web site, Web pages 
on the site, and/or parts of the Web pages. 

Example types of counter events (754) allowed in a 
counter record (750) include hyperlink impressions, Web 
page impressions, shopping cart impressions, hyperlink 
click-throughs, shopping cart drops, and purchases. These 
are basic activities of shoppers in electronic commerce Web 
sites, and the ability to analyze the patterns of these activities 
is critical for successful product marketing and sales. 

An event (754) can be related to a number of various 
counter objects (752). As a non-limiting example, in the 
table below, an "x" indicates a valid relationship between an 
event (754). and an object (752); and that a counter (750) 
could exist for this relationship. 


Event> 




Hyperlink 


Shopping 



Hyperlink 

Click- 

Web page 

Cart 


Object Type 

Impression 

through 

Impression 

Placement 

Purchase 

product category' 

X 


X 

X 

X 

product 

X 


X 

X 

X 

Web page type such as 

X 


X 



home page 

X 


X 



search results page 

X 


X 



index page 

X 


X 



shopping cart page 

X 


X 



product category page 

X 


X 



product page 

X 


X 



error page 

X 


X 



site map page 

X 


X 



hyperlink 

X 

X 





merchandisers can determined how effective the Web site is Conversion rates (850) describe the likelihood of one 
at the fine granularity of the object level and determine event translating into a second event. FIG. 8 shows a 
conversion rates at these granularities or sub-categories. The 60 conversion rate set (800) of various example conversion 
invention enables relating product taxonomy to their Web rates (850) and shows how the rates (850) are determined 
page presentation and tracking how each product (object) at using the counter records (750) shown in FIG. 7. A rate value 
each taxonomy level is used on a Web site. in a rate record (850) is calculated by dividing the values of 
Generally, the conversion rates measure the effectiveness two counters that share a common object (852). In a typical 
of marketing of a product in an e-commerce Web site, 65 example case (850), the counter value (756) for page imp res- 
representing what percentage of requesters a hyperlink sion events of the home page is divided by the counter value 
retained and carried over to the next step in the shopping (766) for hyperlink impression events of links pointing to 
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the home page object (854). The resulting value (856) is the 
percentage of home page hyperlink impressions that led to 
impressions of the home page. In other words, the percent- 
age of those users who saw the home page out of all those 
who saw links to the home page. A higher conversion rate in 5 
this context means that the links are more effective in getting 
users to click on them. 

Because the aggregation process can efficiently create 
counter sets for any given event and any given object on a 
Web page and/or Web site, this invention can determine the 10 
conversion rates of almost any general type and number of 
objects in a Web site for almost any general type and 
numbers of event. This Ls done with only limited amount of 
performance cost. The result is extending the meaning and 
use of conversion rates in measuring Web, site performance is 
and Web site design. 

As an illustration, a non- limiting example of valid con- 
version rates (850) are described in the table below. An "x" 
indicates that the event counter A can be divided by, or 
otherwise related to, event counter B. 
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The aggregation process (900) requires another type of 
input, i.e., the conversion rate records (905). There is one set 
of conversion rate records (880) for each counter record 
object (881). In the above example, there will be a set of rate 
records for the IBM G42 monitor and another set for the 
business computing category. In each set, each member 
record has; a different event, i.e., conversion rate computa- 
tion method, for the given object. That is, in the rate set of 
the IBM G42 monitor, there is a record (882) for the number 
of page impression divided by the number of hyperlink 
impression, and another (884) for the number of shopping 
cart placement divided by the number of page impression. 
The event (873, 875, and 877) of each record (872, 874, and 
876)) in the business computing category set is similarly 
determined. The conversion rate records (800) again can be 
implemented by using a two<Hmensional array with the first 
dimension representing the rate objects and the second being 
the rate computation methods. Each cell in the array repre- 
sents the conversion rate of interest. The aggregation process 
(900) creates this array of conversion rate records (910). 


EVENT A> 


EVENT B 

Hyperlink 
Impression 
Hyperlink 
Click- through 
Web page 
Impression 
Shopping Cart 
Placement 
Purchase 


Shopping 

Hyperlink Hyperlink Web page Cart 

Impression Click -through Impression Placement Purchase 


FIG. 9 is a flow chart of a preferred aggregation process 
(900). Initially, the process is given the log (200) as input 
(905). In addition, the aggregation process (900) is given the 
counter objects (852) and events (854) of interest as input 
(905). For example, suppose that the user of the aggregation 
process (900) is an owner of an e -commerce Web site which 
sells computer equipment. If the person is interested in 
tracking how the various products in the business computing 
category on this site are navigated and pursued by the 
visitors, she/he will provide the aggregation process (900) 
with the counter records (750) for the products in the 
category and for the category as input. Further, for the sake 
of simplicity, suppose that there is only one product in the 
category, IBM G42 monitor. Then there can be several 
counter records (750) for the IBM G42 monitor, and the 
same number of counter records for the business computing 
category. The object (752) of the first counter record set is 
the IBM G42 monitor, and this counter record set (780) 
contains one record (782, 784, 786, and 788) for each 
possible event for products — hyperlink impression, Web 
page impression, shopping cart placement, purchase, etc. 
The object of the second set (770) is the business computing 
category, and the set also contains one record (772, 774, 776, 
and 778) for each possible event for product categories. 
These counter records (750) can be implemented by using a 
two-dimensional array with the first dimension representing 
the selected counter objects and the second being the 
selected counter events. Each cell in the array represents the 
value of a counter record. The aggregation process (900) 
creates this array of counter records by using the counter 
objects and events of interest given as input (910), 


The aggregation process (900) retrieves a record from the 

40 1°6 (^15)- The process then retrieves metadata fields (220) in 
the log record, i.e., Web page description field (230) or 
hyperlink field (242), one at a time (920). The process 
checks if the field (220) requires any change to any counter 
record value (756), and, if it does, updates one or more 
counter values selected (1000). The details of this step will 

45 be described below in FIG. 10. 

Whenever there is a change in one or more counter values, 
the aggregation process (900) identifies all the conversion 
rate records (850) whose values are computed by using the 
updated counter value (756), and computes those rate values 

50 accordingly (1100). The details of this will be described 
below in FIG. U. 

The aggregation process (900) continues to parse meta- 
data fields (220) in a log record (400, 250) until there is no 
metadata field left in the record (925). Once the aggregation 

55 process (900) finishes a log record (400, 250), then it 
processes the next record (400,250) from the log (200). This 
process normally continues until (935) there is no record 
(400,250) left in the log (930). Also, sometimes a user may 
request the process to stop (935) even when there are 

60 unprocessed records (400, 250) left in the log (200). 

FIG. 10 is a flow chart of a preferred counter update 
process (1000) showing the details of step 1000 in FIG. 9. 
Note that this process is performed against a metadata field 
(220) retrieved from a log record (400, 250). Initially, the 

65 process (1000) checks if the metadata field (220) is a 
hyperlink field (1005). If so, the process identifies the 
matching counter object by scanning the referenced Web 
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page description subfield (1010), and assigns the object in an 
object variable list (1015). If the object is a product, e.g., a 
product item on sale on the e-commerce Web site, then the 
process (1000) stores the category of the product in the 
object variable list. The process (1000) increments the value 5 
of the counter records whose objects are ones stored in the 
object variable list and whose event is the hyperlink impres- 
sion (1020). In this case, the process finishes (1080) at this 
point. 

If the current metadata field (220) is not a hyperlink field, ao 
then the field is a Web page description field (230). In this 
case, the process identifies the matching counter object by 
parsing the Web page description subfield (1025), and 
assigns the object in an object variable list (1030). If the 
object is a product, then the process stores the category of 15 
the product in the object variable list. The process incre- 
ments the value of the counter records whose objects are 
ones stored in the object variable list and whose event is the 
Web page impression (1035). 

Then the process retrieves the referral page URL from the 20 
referral URL subfield (1040). If the URL is internal to the 
site, the process retrieves the log record (400, 250) whose 
Web page URL subfield value matches with this referral 
page URL (1045). Otherwise, the process skips to step 1060 
(1042). From the log record (400, 250), the process identifies 25 
the hyperlink field whose hyperlink URL matches with the 
Web page URL in the current log record (1050). The process 
increments the value of the counter record (750) whose 
object is the identified hyperlink and whose event (754) is 
the hyperlink click-through (1055). 30 

Now the process checks if the Web page description field 
has a shopping cart designation (1060). If so, the process 
increments the value of the counter records (750) whose 
objects are ones in the object variable list, i.e., product and 
category objects, and whose event (754) is the shopping cart 35 
placement (1065). In this case, the process stops at this point 
(1080). 

In case the Web page description field (230) does not have 
a shopping cart designation, the process continues to check 
if the field has a product purchase designation (1070). If so, 40 
the process increments the value of the counter records (750) 
whose objects (752) are ones in the object variable list, i.e., 
product and category objects, and whose event is the pur- 


some predetermined classification. Additionally, a string that 
uniquely identifies the particular session between a client 
and server is usually alphanumeric. 
We claim: 

1. A computer system having one or more central pro- 
cessing units, one or more memories, and one or more 
network interfaces connected to one or more networks, the 
system further comprising: 

a server process, executed by one or more of the central 
processing units, the server process receiving one or 
more requests for one or more Web pages firom one or 
more requesters through one or more network 
interfaces, producing the requested Web pages from 
one or more memories, and further serving each of the 
Web pages with one or more content elements to the 
requester; 

one or more metadata entries, associated with hyperlinks 
on each of the respective Web pages produced, and 
contained in each of the respective Web pages 
produced, each of the metadata entries containing 
information about one or more of the content elements, 
and each metadata entry having an entry type and an 
entry value; 

a log having a plurality of records, each record having one 
or more requester fields and one or more metadata 
fields; 

a logger process, executed by the server process, that 
stores the metadata entries in one or more of the 
metadata fields, and stores a requester identification, 
associated with the requester, in the requester field of 
the record associated with the respective Web page; 

an aggregation process, executed by one or more of the 
central processing units, the aggregation process tra- 
versing the log, extracting one or more of the metadata 
fields; 

a counter set, generated by the aggregation process, 
having a plurality of counters, each counter having a 
counter object, a counter event, and a counter value; 
and 

a conversion rate set, generated by the aggregation 
process, having a plurality of rates, each rate having a 
rate object, rate event and a rate value. 

2. A system, as in claim 1, where the counter object in the 


chase (1075), The process stops at this point (1080) 

FIG. 11 is a flow chart of a preferred conversion rate 45 counter includes one or more of distinct values of metadata 

determination process (1100). This process starts when it is entries which include any one or more of the following: a 

notified of any update in one or more counter records (1105). description of a Web page, a hyperlink including a product 

This process identifies all the conversion rate records (850) category name, a product name, a home page designation, a 

whose object is the same as that of the counter records (750) search page designation, an index page designation, a 

and whose event (854), i.e., conversion rate determination 50 shopping-cart page designation, a product-purchase page 


method, uses the event of the counter record (1110). The 
process then goes over each of the conversion rate record 
(850) and updates its value (856) by applying its rate 
determination method to the new counter values (1115). 

The following are definitions used in the present inven- 
tion: a cross-sell link is a link to a Web page marketing an 
item complementary in function to the item marketed on the 
current Web page; an up-sell link is a link to a Web page 
marketing a similar but more upscale item than the item 


designation, a sitemap page designation, an error page 
designation, a URL of a Web page, a hyperlink, a description 
of a hyperlink appearance including a media type, a font 
type, a color, a location, and a merchandising classification 
55 including a cross-sell, a up-sell and an advertiserment. 

3. A system, as in claim 1, where the counter event in the 
counter includes any one or more of the following: an 
impression of hyperlinks, a number of click-through of 
hyperlinks, an impression of Web pages, a shopping-cart- 


marketed on the current Web page; a personalization link is 60 drops of products, a purchase of products and a product 


a link to a Web page which offers personalized service, such 
as the display of a selected subset of products, based on 
knowledge of a user; a recommender link is a link to a Web 
page which suggests Web pages that are likely to be of 
interest to the user based on knowledge of the user and the 65 
behavior of a larger population; and a requester classifica- 
tion is a method of categorizing requesters according to 


category. 

4. A system, as in claim 1, where the counter value in a 
counter has a number of occurrences of a particular counter 
event of a counter object. 

5. A system, as in claim 1, where the rate object in the rate 
includes one or more of distinct values of metadata entries 
which include any one or more of the following: a descrip- 
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tion of a Web page, a hyperlink including a product category 
name, a product name, a home page designation, a search 
page designation, an index page designation, a shopping-cart 
page designation, a product -purchase page designation, a 
sitemap page designation, an error page designation, a URL 
of a Web page or a hyperlink, a description of a hyperlink 
appearance including a media type, a font type, a color, a 
location, and a merchandising classification including a 
cross-sell, an up-sell, and an advertisement. 

6. A system, as in claim 1, where the rate event in the rate 
describes one or more of methods for computing one or 
more of rate values and includes any lone or more of the 
following: the number of click-throughs of a hyperlink 
divided by the number of impressions of the hyperlink, the 


10 


a URL, a URL of a referral Web page, and a description of 
the respective Web page. 

14. A system, as in claim 13, where the description of the 
respective Web page includes any one or more of the 
following: a home page designation, a product name, a 
product category name, a search page designation, an index 
page designation, a shopping-cart page designation, a 
product-purchase page designation, a sitemap page 
designation, and an error page designation. 

15. A system, as in claim 1, where the logger process 
further removes the metadata entries so that the metadata 
entries of the respective Web page are not served with the 
page. 

16. A process, executing on a computer connected to a 


number of impressions of a Web page divided by the number is network, comprising the steps of: 


of impressions of the Web page, the number of the page 
impression of a product or product category divided by the 
number of the hyperlink impressions of the product or 
product category, the number of the shopping-cart-drops of 
a product or product category divided by the number of the 20 
page impressions of the product or product category, the 
number of the shopping-cart-drops of a product or product 
category divided by the number of the hyperlink impressions 
of the product or product category, the number of the 
purchases of a product or product category divided by the 25 
number of the shopping-cart-drops of the product or product 
category, the number of the purchases of a product or 
product category divided by the number of the page impres- 
sions of the product or product category, and the number of 
the purchases of a product or product category divided by 30 
the number of the hyperlink impressions of the product or 
product category. 

7. A system, as in claim 1, where the rate value in a rate 
includes one or more numbers resulted from the application 

of one or more computing methods described in the rate 35 
event to one or more counter values of a counter whose 
counter object matches with the rate object. 

8. A system, as in claim 1, where the requester identifi- 
cation is any one or more of the following: a cookie, an IP 
address, a session identifier, a (usually alphanumeric) string 40 
which uniquely identifies the particular session between a 
client and server, and a requester classification (a method of 
categorizing requesters according to some predetermined 
classification). 

9. A system, as in claim 1, where the metadata field 
contains one or more hyperlink fields, where each of the 
hyperlink fields comprise any one or more of the following: 
the URL referenced by a hyperlink on the respective Web 
page being served, a description of the respective Web page 
referenced by the hyperlink, an application-specific 
description, and a description of a hyperlink appearance 
when it is rendered by a browser. 

10. A system, as in claim 9, where the description of the 
Web page referenced by the hyperlink includes one or more 
the following: a product category name, and a product name. 

11. A system, as in claim 9, where the description of the 
hyperlink appearance includes one or more of the following: 
a media type, a font type, a location on the rendered Web 
page, and a color. 

12. A system, as in claim 9, where the application-specific 60 
description includes one or more of the following merchan- 
dising classification links: a cross-sell link, an up-sell link, 

a promotion, an advertisement, a search, a direct-link, a 
personalization link, and a recommender link. 

13. A system, as in claim 1, where the metadata field 
further contains a Web page description field, where the Web 
page description field is any one or more of the following: 


45 


50 


55 


65 


receiving one or more requests for one or more Web pages 
from one or more requesters through one or more 
network interfaces; 

producing the requested Web pages from one or more 
memories; 

serving each of the Web pages with one or more content 
elements to; the requester, wherein there are one or 
more metadata entries, associated with hyperlinks on 
each of the respective Web pages produced, and con- 
tained in each of the respective Web pages produced, 
each of the metadata entries containing information 
about one or more of the content elements, and each 
metadata entry having an entry type and an entry value; 

storing the metadata entries in one or more metadata fields 
of a log, wherein the log has a plurality of records, each 
record having one or more requester fields and the 
metadata fields; 

storing a requester identification, associated with the 
requester, in the requester field of the record associated 
with the respective Web page; 

traversing the log, extracting one or more of the metadata 
fields; 

generating a counter set having a plurality of counters, 

each counter having a counter object, a counter event, 

and a counter value; and 
generating a conversion rate set having a plurality of rates, 

each rate having a rate object, rate event and a rate 

value. 

17. A system for determining how a Web site is being used 
comprising: 

means for receiving one or more requests for one or more 
Web pages from one or more requesters through one or 
more network interfaces; 

means for producing the requested Web pages from one or 
more memories; 

means for serving each of the Web pages with one or more 
content elements to the requester, wherein there are one 
or more metadata entries, associated with hyperlinks on 
each of the respective Web pages produced, and con- 
tained in each of the respective Web pages produced, 
each of the metadata entries containing information 
about one or more of the content elements, and each 
metadata entry having an entry type and an entry value; 
means for storing the metadata entries in one or more 
metadata fields of a log, wherein the log has a plurality 
of records, each record having one or more requester 
fields and the metadata fields; 

means for storing a requester identification, associated 
with the requester, in the requester field of the record 
associated with the respective Web page; 
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means for traversing the log, extracting one or more of the 

metadata fields; 
means for generating, a counter set having a plurality of 

counters,, each counter having a counter object, a 

counter event, and a counter value; and 5 
means for generating a conversion rate set having a 

plurality of rates, each rate having a rate object, rate 

event and a rate value. 
18. A computer program product which performs the steps 
of: 10 
receiving one or more requests for one or more Web pages 

from one or more requesters through one or more 

network interfaces; 
producing the requested Web pages from one or more 15 

memories; 

serving each of the Web pages with one or more content 
elements to the requester, wherein there are one or more 
metadata entries, associated with hyperlinks on each of 
the respective Web pages produced, and contained in 20 
each of the respective Web pages produced, each of the 
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metadata entries containing information about; one or 
more of the content elements, and each metadata entry 
having an entry type and an entry value; 

storing the metadata entries in one or more metadata fields 
of a log, wherein the log has a plurality of records, each 
record having one or more requester fields and the 
metadata fields; 

storing a requester identification, associated with the 
requester, in the requester field of the record associated 
with the respective Web page; 

traversing the log, extracting one or more of the metadata 
fields; 

generating a counter set having a plurality of counters, 

each counter having a counter object, a counter e vent, 

and a counter value; and 
generating a conversion rate set having a plurality of rates, 

each rate having a rate object, rate event and a rate 

value. 

* * * * * 
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